Contrastive Analysis and Native Language Identification

نویسندگان

  • Sze-Meng Jojo Wong
  • Mark Dras
چکیده

Attempts to profile authors based on their characteristics, including native language, have drawn attention in recent years, via several approaches using machine learning with simple features. In this paper we investigate the potential usefulness to this task of contrastive analysis from second language acquistion research, which postulates that the (syntactic) errors in a text are influenced by an author’s native language. We explore this, first, by conducting an analysis of three syntactic error types, through hypothesis testing and machine learning; and second, through adding in these errors as features to the replication of a previous machine learning approach. This preliminary study provides some support for the use of this kind of syntactic errors as a clue to identifying the native language of an author.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Contrastive Analysis of Metadiscourse Markers Used by Non-native (Iranians) vs. Native (Americans) Speakers in Developing ELT Materials

Metadiscourse is a widely used term in current discourse analysis and language education, referring to an interesting, and relatively new approach to conceptualizing interaction between text producers and their texts and between text producers and users. Despite the growing importance of the term, however, it is often understood in different ways and used to refer to different aspects of langua...

متن کامل

A Contrastive Study of Persian and English Written Discourse: Ellipsis in Realistic Novels

  This study aspires to examine the concept of ellipsis by comparing and contrasting English and Persian written texts. For this purpose, three Persian novels and three English ones were selected. These novels were analyzed carefully; they were compared and contrasted for types and amount of ellipsis used, through a Chi-square analysis.  The results of the data analysis revealed that various t...

متن کامل

Exploiting Parse Structures for Native Language Identification

Attempts to profile authors according to their characteristics extracted from textual data, including native language, have drawn attention in recent years, via various machine learning approaches utilising mostly lexical features. Drawing on the idea of contrastive analysis, which postulates that syntactic errors in a text are to some extent influenced by the native language of an author, this...

متن کامل

A Corpus-Based Contrastive Analysis of Stance Strategies in Native and Nonnative Speakers’ English Academic Writings: Introduction and Discussion Sections in Focus

The present study was an attempt to illustrate the interaction between writers and readers. Conveying of the writers’ voice, stance, and interaction with reader was put forward within this paradigm. Being a good academic writer is highly related to the use of these strategies.  Adopting a position and persuading readers of claims are very important. This study was aimed at showing th...

متن کامل

The Effect of Contrastive Analysis on Iranian Intermediate EFL Learners of L2 Adjective Knowledge

Contrastive  analysis  of  hypothesis  is  the  comparison  of  the linguistic system of two or more languages and it is based on the main difficulties in  learning  a  new  language  that  caused  by  interference  from  the  first  language. The present study intended to investigate the effect of contrastive analysis on Iranian intermediate EFL learners’ knowledge of L2 adjectives. The questi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009